Automated Lunar Crater Identification with Chandrayaan-2 TMC-2 Images using Deep Convolutional Neural Networks

Terrestrial planets and their moons have impact craters, contributing significantly to the complex geomorphology of planetary bodies in our Solar System. Traditional crater identification methods struggle with accuracy because of the diverse forms, locations, and sizes of the craters. Our main aim is to locate lunar craters using images from Terrain Mapping Camera-2 (TMC-2) onboard the Chandrayaan-II satellite. The crater-based U-Net model, a convolutional neural network frequently used in image segmentation tasks, is a deep learning method presented in this study. The task of crater detection was accomplished with the proposed model in two steps: initially, it was trained using Resnet18 as the backbone and U-Net based on Image Net as weights. Secondly, TMC-2 images from Chandrayaan-2 were used to detect craters based on the trained model. The model proposed in this study comprises a neural network, feature extractor, and optimization technique for lunar crater detection. The model achieves 80.95% accuracy using unannotated data and precision and recall are much better with annotated data with an accuracy of 86.91% in object detection with TMC-2 ortho images. 2000 images have been considered for the present work as manual annotation is a time-consuming process and the inclusion of more images can enhance the performance score of the model proposed.

• Custom semantic segmentation based on the U-Net model with (Resnet18) is a novel approach to automate the detection of lunar craters, which has not been used before on the Mars-Lunar crater dataset.• The model uses a pre-trained Resnet18 as a backbone on the ImageNet dataset, which allows for faster and more efficient training.• The model leverages the best-performing pre-trained model, specifically trained on a vast and varied dataset, to achieve superior performance on the TMC-2 DEM images from the Chandrayaan-2 Mission.This ensures that the model has acquired comprehensive and broadly applicable characteristics to efficiently detect craters on the surface of the moon.The model gains from the previous knowledge and representations acquired from the ImageNet dataset, which includes a variety of object types, by utilizing the pre-trained model.• By sharing full-image convolutional features, the detection network and the region proposal network increase object detection accuracy.The pre-trained model's incorporation into the novel semantic segmentation method-which is based on the U-Net model-further improves detection powers.The fusion of the pretrained model's high-level features with the U-Net's ability to preserve detailed information facilitates accurate and comprehensive crater detection.• Overall, this approach allows for a more detailed and accurate representation of the lunar surface than was previously possible and has important implications for future exploration and scientific research.This fusion results in a very efficient method for utilizing the TMC-2 DEM data from the Chandrayaan-2 Mission to find craters on the lunar surface.
Following is an outline of this paper's structure: A thorough literature assessment is provided in "Historical background of the development of crater detection algorithm" section, with an emphasis on earlier studies in deep convolution neural networks and the detection of craters on Mars and the moon.A thorough discussion of relevant research is provided.The difficulties or obstacles that the data set presents are covered in "Dataset" section.An overview of the dataset used in the current study is given in "Dataset" section.The methodology is presented in "Model description, Methods, techniques, studied material, and area description" section, which offers a thorough justification of the selected course of action.The experimental results are given in "Results" section and discussed in "Discussion" section.Limitations of our model are provided in "Limitations" section."Conclusion" section offers concluding remarks, summarizing the findings and their implications while providing a comprehensive discussion of the research conclusions drawn and discusses the future scope of the research.

Historical background of the development of crater detection algorithm
There have been several research studies and publications related to this field.Here are some examples of related work on lunar crater detection described.The use of a type of artificial intelligence called CNNs to detect lunar craters on the moon was explored by DeLatte, et al. 20 .A technique was presented to train a CNN to automatically detect craters in photographs of the surface of the moon by learning its characteristics, which include its round form and sharp edges.Previous crater-based terrain relative navigation (TRN) systems identified craters from visual imagery using conventional image processing methods.These methods, however, are frequently not resistant to alterations in camera settings, angles for viewing, and illumination.Recent studies have demonstrated that by using advancements in computer vision and deep learning, automated crater detection may significantly enhance robustness 21 .CNN was introduced in 2012, and as neural network-based image segmentation methods gained popularity, the relevant evaluation procedures to assess the impact of craters in the charge coupled device (CCD) photos of Chang by Jia et al. 22 .A methodology for lunar crater detection and classification using CNNs was proposed.Utilization of a dataset of high-resolution lunar images and training a CNN model to identify and classify craters based on their size and morphology.Specializing in crater recognition on the lunar surface using the support vector machine (SVM) method, high accuracy in recognizing and categorizing lunar craters is obtained, indicating the efficacy of CNNs for this task.The SVM classifier utilized the points of interest and relevance of each crater to create a significant feature vector.The SVM vector was then utilized to create impact craters and classification criteria that identified impaling objects 23 .They extracted various features from lunar images, such as color, texture, and shape descriptors then trained an SVM model on a labeled dataset to detect craters.The study demonstrates the feasibility of using SVM for lunar crater detection and achieves promising results 23 .The work presents a methodology based on traditional image processing techniques for lunar crater detection.Filters, segmentation, and morphological operations to enhance lunar images and identify crater-like structures were employed.The study evaluates the effectiveness of different image processing techniques for crater detection and discusses their limitations compared to more advanced machine learning approaches.The study examines many deep-learning architectures for crater identification on the moon.The subject of study was the effectiveness of many CNN models, including AlexNet, VGGNet, and ResNet, on a sizable collection of images 24 .By leveraging the spatial distribution and characteristics of the clusters, they achieve reliable crater detection without the need for labeled training data.These related works highlight the application of CNN and alternative methodologies for lunar crater detection.Each study explores different techniques, ranging from traditional image processing methods to machine learning approaches, demonstrating the diversity of approaches in this field 25 .The findings assist in the development of reliable and effective methods for detecting lunar craters, which can facilitate exploration and research of the lunar surface-a technique for locating craters in lunar mare regions and determining their ages 26 .The method uses image processing techniques and machine learning algorithms to locate craters and estimate their ages based on physical features.Using image processing techniques, the photographs' contrast is enhanced, and the crater-specific characteristics are brought to light.The features are then categorized, and the craters are located using machine learning methods.A deep learning strategy based on grid partition for crater detection on the lunar surface was suggested by Hashimoto and Mori 27 .The process creates a grid out of the lunar surface and employs a convolutional neural network to find craters in each grid cell.To teach the network the characteristics of craters, a sizable data collection of lunar photographs was used for training.

Dataset
TMC-2 ortho and LRO NAC images (obtained from the RoboFlow) have been used during the training phase and testing phase of the proposed model while for validation of the model, only TMC-2 ortho images have been used.Technical details of TMC-2 and LRO NAC images are as follows: TMC-2: The Terrain Mapping Camera (TMC) is a high-resolution camera that was created to map the terrain of the Moon's near and far sides.Images for TMC-2 are sourced from ISRO's Chandrayaan-II (issdc.gov.in)Pradan Website 28 .TMC-2 is an instrument component onboard on Chandrayaan-II Mission.A thorough 3D atlas with a spatial and elevation resolution of 5 meters is being made as the main objective to better comprehend the lunar surface and the processes affecting it.TMC has a spatial resolution of 5 meters, a 20-km swath from 100 km lunar polar orbit, and functions throughout the 0.5-0.85m panchromatic spectral range.Fore, nadir, and aft views of the satellite's ground track are recorded by the camera using a push-broom imaging mode and three linear 4k element detectors.The fore and aft view angles are 25 degrees apart from the nadir.TMC has a dynamic range that can capture both newly formed crust rocks and fully developed mare soil, and it detects the solar radiation that is reflected or scattered from the Moon's surface.Leveraging the proven performance of this model, it is further utilized for detecting lunar craters using TMC-2 (Terrain Mapping Camera-2) images.Figure 1 shows the schematic of the principle of operations of TMC-2.FOV (Field of View) is the angular view which camera observes at a given time.
LRO NAC: The lunar reconnaissance orbiter (LRO) is equipped with a unique set of three cameras known as the lunar reconnaissance orbiter camera (LROC).The LROC captures high-resolution black-and-white images, along with moderate-resolution multi-spectral photos of the lunar surface, providing an invaluable resource for lunar exploration and research.The LROC primarily consists of two narrow-angle cameras (NACs).These cameras are specifically planned to produce 0.5 meter-scale panchromatic images across a stretch of 5 km on the moon's surface, enabling detailed visualization of the lunar topography.In addition to the NACs, the LROC also features a wide-angle camera (WAC).This camera is designed to generate images at a scale of 100 m/pixel in seven distinct color bands, covering a broad swath of 60 km on the lunar surface.This provides a broader context and an overview of the lunar landscape.Lastly, the sequence and compressor system (SCS) works in tandem with both the Narrow Angle and Wide-Angle Cameras, aiding in efficient data collection.The compressor makes it possible to manage and process the vast amounts of data generated by the high-resolution cameras, ensuring seamless operation of the LROC system 30 .
As annotated TMC-2 images were unavailable, the approach was trained and tested on Mars and Lunar crater images obtained from the RoboFlow augmentation model.This dataset includes images and labels of the surfaces of Mars and the Moon, which may contain craters.The data source has a mixture of Mars images from ASU and USGS, and moon images are from NASA's Lunar Reconnaissance Orbiter mission LROC NAC 31,32 .The public dataset of Mars-Lunar craters had been annotated for object detection purposes and sourced from Roboflow Universe 33 .The images have been pre-processed using RoboFlow to remove EXIF rotation and resize to 512 × 512.Each image comes with a labeling file in YOLOv5 text format that we created ourselves for object detection purposes.Additionally, we provide a trained YOLOv5 model file for each new version of the dataset using the latest version of the data.The current network structure used is YOLOv5m6.We have used 143 images of the Mars_Lunar dataset 33 for training when performing a train-test-validation split and craters are annotated in YOLO v5 PyTorch format.The total number of craters in the training dataset is 1034.Mars_Lunar dataset is used to illustrate the efficiency of transfer learning in the performance of the model as well as make our model generic.
Lunar Dataset Details: The model was trained using only the lunar dataset for the unannotated TMC 2 dataset.For which the details are provided below: Normalization is performed to scale the image pixel values to a common range, such as [0,1] or [− 1,1].This phase reduces the impact of lighting and contrast differences, allowing the model to concentrate on learning structural features rather than pixel intensity variations.Data augmentation techniques, including rotation, scaling, inversion, and the addition of random noise, are used to generate additional training images and increase dataset diversity.We divided the annotated dataset into multiple subsets or folds using k-fold cross validation, to ensure that the model is trained on a substantial portion of the data and that distinct images are available for objective evaluation.To prevent performance bias, care is taken to sustain the distribution of crater and noncrater images in both sets.Roboflow is used for augmenting the dataset which means augmentation is applied to existing images in the dataset to improve the ability of the model to generalize and perform well for new or unseen images.Augmentations are grouped and randomized with settings and values for each setting are applied to each augmented image.Duplicate images are also filtered in the process.Image augmentation is applied to only the training dataset 35 .
Roboflow helps in labeling/annotating the craters by using the bounding box.A bounding box, or the rectangle that surrounds an object in an image, defines each object.The width, height, and x and y coordinates of the top-left corner of the box, along with its coordinates, define the bounding box 36 .
• The discovered object's class label (all objects in this instance have the same label, 0).
• The bounding box's top-left corner's x-coordinate, normalized by the image's width • The bounding box's top-left corner's y-coordinate, normalized by the image's height • The bounding box's width, adjusted for the image's width • The bounding box height, scaled according to the image's height However, the Roboflow library is commonly used for data augmentation and pre-processing in computer vision tasks and may have been used in the training pipeline for the object detection model.Roboflow is a platform for managing, annotating, and augmenting image datasets.It provides various tools for data annotation, data pre-processing, and data augmentation.The platform supports various popular image formats and provides an easy-to-use interface for managing datasets.Roboflow also offers extensive integration with popular deeplearning frameworks like TensorFlow, PyTorch, and Keras.On the other hand, Albumentations is a Python library for image augmentation in machine-learning experiments.It provides various image transformation techniques like flipping, rotating, scaling, cropping, and color changes.It is designed to be fast, flexible, and easy to integrate with popular deep-learning frameworks like PyTorch and TensorFlow.It supports both CPU and GPU processing, and its augmentations are compatible with both NumPy and PyTorch tensors.Roboflow is more of a platform for managing and pre-processing image datasets, while the Albumentations library is specifically focused on image augmentation for machine learning.
Since our dataset comprises different quality images (Mars and Lunar), the model is trained for various image qualities.Due to this our model can be tested by any quality of image.
Figure 4a is the original image of the Moon surface from the Mars_Lunar Dataset. Figure 1b presents the labeled feature using Roboflow, the X and Y-axis are the pixel values of the image where each pixel is 0.5-2 m.A dataset of 2000 TMC 2 images has been annotated with crater locations and sizes by human experts and is currently used for training and testing crater detection algorithms.The dataset covers various regions of the Moon, including the nearside and the farside, and different types of terrain, such as mare, highlands, and polar areas.The dataset is divided into 24 subsets, each contains 83 or 84 images, for cross-validation.Cross-validation is a technique that splits the dataset into multiple subsets and evaluates the performance of the algorithm on each subset, using the rest of the data for training.This helps to reduce the bias and variance of the evaluation and to estimate the generalization ability of the algorithm.
Different types of difficulties present in the image for accurately identifying their features: 1. Overlapping, nested, and small-shaped lunar craters present difficulties in accurately identifying their features.Their description is given in Table 1.Ensuring consistency is challenging for quantifying the data into algorithms.2. Identifying and measuring parameters defining crater shape accurately is complex.Challenges arise in distinguishing craters from other formations, determining various parameters, and addressing limitations in lighting conditions and data quality.Morphological characteristics on the lunar surface presenting challenges in the identification of craters are analyzed in Fig. 5. 3. Lack of Ground Truth: The absence of annotated data means that there is no ground truth for model training and evaluation.The models must rely on inherent patterns in the data to learn and generalize lunar surface features.4. Visualizing craters across diverse datasets is complicated by variations in lighting and shadow rendering regions which is shown in Table 2. Accurately defining crater boundaries and coverage becomes complex due    We have trained our model on NASA lunar images which are labeled and annotated and fine-tuned it to detect craters in lunar images taken by ISRO, which are unlabelled.
• We used Transfer Learning on a pre-trained model to identify craters in the images from TMC-2 37 .
• After all the pre-processing steps including Canny Edge Detection and Hough transform, the U-Net model predicts a mask where each pixel is labeled as "crater" or "non-crater".This is usually done for crater spots ranging between 10 and 40 px with the help of this we can obtain craters starting range is 10 m. • The location and dimension of each of the craters found in the image using post-processing of the image, are then converted into geographical coordinates, thus the repetitive craters after resolution in different images can reappear which are separated based on geographical image coordinates on the pixel map.

Methodology
Deep Learning The primary objective of deep learning, a branch of machine learning, is to train multi-layered artificial neural networks to understand and predict complicated data.Speech recognition, natural language processing, and computer vision are just a few of the fields it has transformed.By automatically deriving hierarchical representations from data, deep learning models may extract significant characteristics and generate precise predictions.Deep learning models acquire the ability to automatically extract features and patterns from the input data during the training phase.The technique known as backpropagation is used to do this, and it entails

Shadowed region
Lighting region modifying the neural network's weights and biases to reduce the discrepancy between the true and expected outputs.The model can identify the ideal set of parameters through this optimization process, which best captures the underlying patterns in the data 26 .Computer vision tasks including segmentation, object identification, and picture classification have been transformed by deep learning.Image identification challenges have shown amazing accuracy for convolutional neural networks (CNNs), a sort of deep learning architecture particularly built for evaluating visual input.Deep learning does, however, present several difficulties.Both a large amount of labeled data and a significant amount of processing power are needed for deep neural network training.Deep learning algorithms are prone to overfitting due to their complexity, which occurs when the model performs well on training data but not on fresh, unknown data.Regularization techniques and larger datasets are often employed to mitigate overfitting.Overall, deep learning has had a transformative impact on various fields, enabling breakthroughs in image analysis, natural language understanding, and many other areas.It continues to advance the frontiers of AI and holds promise for solving complex problems and driving innovation in the future.Figure 7 illustrates the model architecture designed specifically for crater detection here in the figure using Mars_Lunar dataset images.The U-Net architecture is used, which has an encoder route that records high-level characteristics while progressively decreasing the spatial resolution.The decoder route, on the other hand, oversees upsampling the features to return the original spatial resolution.By adding skip links between the respective levels of the encoder and decoder, it is possible to combine high-level and low-level data, which guarantees precise and thorough crater identification in Mars_Lunar dataset images.U-Net A common architecture for image segmentation tasks is U-Net.It comprises a decoder pathway that upsamples the features to regain the spatial resolution and an encoder pathway that progressively diminishes the spatial resolution while collecting high-level information 38 .U-Net joins equivalent levels of the encoder and decoder via skip links, allowing low-level and high-level information to be combined.It has been extensively utilized for applications like object identification and biological picture segmentation.Overall, U-Net's architecture, with its skip connections, symmetric design, efficient parameter usage, and versatility, has established itself as a powerful tool for image segmentation tasks, providing accurate and detailed segmentation results in various domains.The authors use this model for training the Mars and Lunar images.Modified U-Net architecture 39 developed by (Olaf Ronneberger et al.) has been used in our proposed model.Figure 8 illustrates the model developed by Olaf Ronneberger.
FCNN (Fully Convolutional Neural Network) FCNN is a type of neural network architecture designed for pixellevel prediction tasks such as semantic segmentation.Unlike traditional convolutional neural networks (CNNs) that output a single label for the entire input image, FCNNs produce dense predictions by preserving the spatial

Hybrid ResNet UNet model for crater detection in CH2 TMC2 images
To fulfill our objective we divide our model into two parts: (1) For training we utilize U-Net architecture based on resnet18 as the backbone of annotated images and labels obtained from the Roboflow augmentation library used especially for image augmentation purposes, (2) To detect craters, we have deployed a model, to achieve efficient detection of lunar craters on images acquired by TMC-2 on board the Chandrayaan-2 Mission.
In our proposed work, we implement a modified version of the UNET architecture, as shown in Fig. 9.The architecture comprises two main parts: an encoder on the left that reduces the spatial dimensions of the input, and a decoder on the right that restores the output to the original size.The two parts are connected by multilevel skip connections that transfer features across different scales.We use the Mars_Lunar crater dataset, which contains 143 images and corresponding labels in YOLO v5 PyTorch format, exported from Roboflow 33 .
In the proposed model custom implementation of the semantic segmentation U-Net architecture is performed.Using a symmetric expanding path to permit exact localization and a contracting path to collect context, U-Net is a popular deep-learning architecture for image segmentation problems.ResNet18, a well-liked deep convolutional neural network design with better accuracy while training on huge datasets than earlier models, serves as the foundation for this application.The model is optimized for the particular purpose of semantic segmentation using a cross-entropy loss function, having been pre-trained on ImageNet.Our model is trained using Customize semantic segmentation based on U-Net with ResNet18 as the backbone and ImageNet used as weights, for binary segmentation of images with 6 input channels and 2 output classes.The model is classified into two backgrounds and a crater.This specifies that weights pre-trained on ImageNet will be employed to initialize the pre-trained weights of the backbone network used in the U-Net design.This is useful because Image Net is a large-scale dataset with a diverse set of image categories, and pre-training on it can help the model learn better feature representations.We trained and tested our model on Google Colab Pro+ utilizing a GPU unit to achieve better computational performance.In this case, we used Python 3.7.4version for the detection part, which corresponds to the 2nd part of our model.We can benefit from the characteristics that the model learned during the pre-training phase and refine them for the item identification task at hand by employing a pre-trained ResNet-50 backbone.The term "K-folds = 5" describes a kind of cross-validation method in which the data is divided into five "folds, " or equal-sized subgroups.The procedure is then done five times, using each fold as a test set once, with the model being trained on four of the folds and tested on the remaining fold.This • medium: This means that the code will detect craters that are between 5 and 15 pixels in diameter, which corresponds to craters between 1.25 and 3.75 km in real life.• small: This means that the code will detect craters that are less than 5 pixels in diameter, which corresponds to craters smaller than 1.25 km in real life.• large: This means that the code will detect craters that are larger than 15 pixels in diameter, which corresponds to craters larger than 3.75 km in real life.
The Max detections parameter is used to limit the number of bounding boxes that the model predicts for each image.It is set to 100 by default.This means that the model will output at most probable 100 bounding boxes for each image, regardless of how many craters are present in the image.The max detection parameter affects the performance of the crater detection algorithm in two ways: • If the max detections are too low, the model might miss some craters that are present in the image, resulting in lower recall and precision scores.Recall is the fraction of true craters that are detected by the model, and precision is the fraction of detected craters that are true craters.• If the max detections are too high, the model might output too many bounding boxes that are not craters, resulting in lower precision and higher inference time.
Inference time is the time it takes for the model to process an image and output the bounding boxes.Therefore, the optimal value of the max detection parameter depends on the distribution and density of craters in the images, as well as the trade-off between accuracy and speed.
The different values of IOU (Intersection over Union) are used to measure the accuracy of the object detection model.IOU is the ratio of the area of overlap between the predicted bounding box and the ground truth bounding box to the area of union between them.A higher IOU means a better match.
COCO metrics are evaluation criteria for object detection models.They were first proposed in the Microsoft COCO challenge by Lin et al. and have since been the standard evaluation criteria for object detection models.The code uses the COCO evaluation metric, which computes the average precision (AP) for different IOU thresholds.The rows of the result table show the AP for different IOU values, such as: (a) 0.50:0.95:This is the mean AP over IOU thresholds from 0.50 to 0.95 with a step size of 0.05.This is the most comprehensive and robust metric, as it evaluates the model across a wide range of IOU values.(b) 0.50: This is the AP for an IOU threshold of 0.50, which is equivalent to the PASCAL VOC metric.This is a less strict metric, as it only requires a 50% overlap between the predicted and ground truth bounding boxes.(c) 0.75: This is the AP for the IOU threshold of 0.75, which is a stricter metric, as it requires a 75% overlap between the predicted and ground truth bounding boxes.This evaluates the model's ability to produce precise localization.

Analysis of data without annotation
Preliminary results indicate that the models trained on data without annotations exhibit lower average recall, precision, F1 measure, and accuracy compared to their counterparts trained on annotated datasets.Our findings underscore the significance of annotated data in training robust models for lunar surface analysis.While approaches show promise, further research is needed to address the unique challenges posed by the absence of ground truth labels.Applying k-fold cross-validation is particularly beneficial when working with limited data, as it helps ensure that the model's performance evaluation is more representative and less dependent on a specific random split of the data.It also provides insights into the model's stability and generalization across different subsets of the data.
Table 3 presents the average recall of subsets, showcasing the model's performance.Table 4 then illustrates the average precision of subsets in the proposed lunar crater detection model.Meanwhile, Table 5 provides insights into the model's performance through F1 score and accuracy, complementing the already obtained average recall and precision values for subsets.The analysis reveals an accuracy of 80.96 and an F1 score of 0.7519.
Figure 10 is a line graph depicting the performance of the proposed model using unrelated data.This graphical representation includes average precision, recall, F1 score, and accuracy across all epochs of subsets, with values ranging from zero to one.
In Table 6, the images originate from the TMC 2 sensor of Chandrayaan 2. The coordinates corresponding to these images are provided in Figs. 1 and 2. Notably, these images are unannotated and serve as new input for the proposed model to identify lunar craters.The ground truth craters are indicated with red circles, and it's essential to note that the X and Y axes denote pixel values within the image, not the actual geographical coordinates.
In the "Prediction Result" column, the left image represents the original input, while the right image displays a heat map illustrating the model's predictions for crater detections.

Analysis of data without annotation
However, the manual annotation of lunar surface data, a crucial step in training models, is a labor-intensive process, often constrained by time and resources.As a result, the availability of annotated data remains limited, presenting a challenge in achieving optimal model performance.Our focus lies on improving the performance of models designed for lunar surface analysis, where the acquisition of annotated data is a bottleneck.This study delves into a multifaceted approach to address the scarcity of annotated lunar surface data.By employing a combination of machine learning and deep learning techniques, data augmentation, and strategic annotation efforts, we aim to optimize model performance and enhance the accuracy of lunar crater identification.
Table 7 highlights the average recall of subsets, showcasing the model's performance.Table 8 illustrates the average precision of subsets in the proposed lunar crater detection model.Meanwhile, Table 9 provides insights into the model's performance through F1 score and accuracy, complementing the already obtained average recall and precision values for subsets.In this analysis, the achieved accuracy is 86.91, accompanied by an impressive F1 measure of 0.7958.
Figure 11 is a line graph depicting the performance of the proposed model using unrelated data.This graphical representation includes average precision, recall, F1 score, and accuracy across all epochs, with values ranging from zero to one.

Predictions by the proposed model
The proposed crater prediction model is now trained on annotated data taken from Terrain Mapping Camera-2 (TMC-2) images of Chandrayaan-2 to explore lunar craters.The model trained using annotated data provided Table 6.Pictorial representation of the detection phase without annotation.The performance of the proposed model on unannotated and annotated data was according to our expectations as the performance of annotated data is more than that of unannotated data of TMC 2. We would further like to improve our accuracy by annotating more images and training the model with these images.

Limitations
Even though our offers valuable insights into the prediction of lunar crater identification have some limitations.The size of the image title used in the dataset is 256 × 256 whereas the size of the original images from the TMC-2 sensor varies.The image size is important for the model performance because it affects the speed and accuracy of the object detection.Generally, smaller images are faster to process but may lose some details or features that are relevant for identifying the craters.Larger images are slower to process but may capture more details or features that are helpful for object detection.Therefore, choosing an appropriate image size is a trade-off between speed and accuracy.Currently, we have trained our model with only 2000 annotated images of the TMC-2 sensor from the Chadrayaan-2 satellite 30 .Further, in extended work, we strive to find an acceptable balance between processing efficiency and detection precision by carefully selecting image sizes.We also intend to use ArcGIS Pro with DEM images to improve crater detection accuracy.Furthermore, an approach will be built involving slicing images into numerous tiles of varied sizes, followed by annotation, to correct the model's bias toward recognizing smaller craters.To assess our suggested methodology and investigate different competing approaches like YOLO, RCNN, and U-Net++, we will conduct extensive training, testing, and validation to design our model more robust to detect large as well as very small craters.To reduce the time consumption of the annotation procedure, we want to develop a qualitative heuristic equation for automatic bordering box generation.Our ultimate objective is to establish a reliable crater identification model by comparing different approaches.Despite obtaining promising accuracy rates of 80.95% (unannotated data) and 86.91% (annotated data), we are still devoted to improving our model's ability to identify a wider range of craters.

Conclusion
Our work presents a novel approach for automated lunar crater detection that employs a bespoke semantic segmentation model based on U-Net with a ResNet50 backbone.For effective training, the model combines a pre-trained ResNet18 on ImageNet and is applied to TMC-2 ortho images from the Chandrayaan-2 satellite.The model, which was trained on a heterogeneous dataset that included Martian and Lunar crater images from multiple sources, makes use of YOLOv5 and Roboflow for data annotation and preprocessing.On unannotated data, the model performs well, achieving 80.95% accuracy and a 0.7519 F1 measure.The accuracy rises to 86.91% on an annotated dataset, with a 0.79589 F1 measure, demonstrating the model's capacity to generalize from labeled data.Understanding challenges like the requirement for better image annotation, the model's accuracy can be improved particularly in recognizing untrained crater portions by applying new tactics such as YOLO v7, YOLO v5, point-based detection approaches, Faster RCNN, and U-Net++.The presented model offers promising results for automating lunar crater detection, contributing valuable insights to lunar exploration and scientific research, and can be further used as an input for the Lunar surface age detection algorithm.The performance of the proposed model can be further improved by using TMC-2 DEM images and Chandrayaan-2 OHRC images.Annotation will be performed in ArcGIS software for accurately locating the craters in DEM images.Further in the study, using the data available from Mangalyaan and Chandrayaan-2, training will be done on the same model as well and different models will also be tested.This will also be done to mitigate over-fitting challenges.As well as to cover the study area i.e.Vallis Schroter which focuses on locating tiles representing the Largest Rille on the Moon.
Total images-3556 • Training set-2310 images i.e., 65% • Testing set-890 images i.e., 25% • Validation set-356 images i.e., 10% • Total number of craters-7048 • Image size-416X416 • The Lunar Dataset collected to train the model proposed is from Roboflow 34 Chandrayaan 2 Dataset Details: The model was then trained-tested and validated using only the lunar dataset for the annotated TMC-2 dataset.For which the details are provided below: • Total images-500 images augmented to 2000 images.• Training set-1400 images i.e., 70% • Testing set-400 images i.e., 20% • Validation set-200 images i.e., 10% • Subsets-24 each containing either 83 or 84 images The model was tested using Chandrayaan 2 TMC 2 data from random locations of the moon.Images that were tested were of different locations.Some samples used are given in Figs. 2 and 3 with their coordinates.Data preprocessing is a very crucial step in the development of a deep learning model.It involved preparing image datasets for training, augmentation to increase the diversity of data, and implementing techniques like k-fold cross-validation to ensure robust evaluation of the model.To ensure uniformity and standardization, the images are resized to a particular resolution, such as 256 × 256 or 512 × 512 pixels.

Figure 4 .
Figure 4. (a) Presents the original image of the Lunar surface and (b) describes annotations.
crater on which other craters are partially or overlapped on it 2 Nested crater A primary crater formed and smaller secondary craters from inside it 3 Small craters Circular depressions on the surface of celestial bodies are formed by the impact of small meteoroids or asteroids.Usually less than 15 km (9 miles) in diameter and have a simple bowl-shaped structure.They are more abundant than large craters Vol:.(1234567890)Scientific Reports | (2024) 14:8231 | https://doi.org/10.1038/s41598-024-58438-4www.nature.com/scientificreports/Model description, methods, techniques, studied material, and area description Proposed model description The deep learning model was divided into two parts: initially, it was trained using the Resnet18 technique as the backbone and U-Net based on Lunar Images obtained from LRO NASA's mission 34 for obtaining a Pre-trained Model.With the help of Transfer Learning, the pre-trained model and the Canny Edge Detection Algorithm bounding boxes were used to identify circular objective craters in the images from TMC-2 37 .Figure 6 depicts the overall flowchart of the proposed model.

Figure 6 .
Figure 6.Flowchart of the proposed model.

Figure 7 .
Figure 7. Model architecture of the designed approach for crater detection.

Table 1 .
Types of craters causing difficulties.

Table 2 .
Different visualization regions cause difficulties.

Table 5 .
Performance of model based on F1-score and accuracy.The Line graph displays the performance of the proposed model using unannotated data.